Complexity and universality in the long-range order of words

نویسندگان

  • Marcelo A. Montemurro
  • Damián H. Zanette
چکیده

As is the case of many signals produced by complex systems, language presents a statistical structure that is balanced between order and disorder. Here we review and extend recent results from quantitative characterisations of the degree of order in linguistic sequences that give insights into two relevant aspects of language: the presence of statistical universals in word ordering, and the link between semantic information and the statistical linguistic structure. We first analyse a measure of relative entropy that assesses how much the ordering of words contributes to the overall statistical structure of language. This measure presents an almost constant value close to 3.5 bits/word across several linguistic families. Then, we show that a direct application of information theory leads to an entropy measure that can quantify and extract semantic structures from linguistic samples, even without prior knowledge of the underlying language.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Impact of Quranic Stories on Students’ Questioning Skills

The Impact of Quranic Stories on Students’ Questioning Skills A.R. Shahabadi Gh. Ahghar, Ph.D. S. Beheshti, Ph.D. The present study was carried out to examine the impact of Quranic stories on students’ questioning skills. The population of the study included high school male 2nd graders in district 5 of Tehran in academic year 2015-2016. The study was a quasi-ex...

متن کامل

The effect of language complexity and group size on knowledge construction: Implications for online learning

This  study  investigated  the  effect  of  language  complexity  and  group  size  on  knowledge construction in two online debates. Knowledge construction was assessed using Gunawardena et al.’s Interaction Analysis Model (1997). Language complexity was determined by dividing the  number  of  unique  words  by  total  words.  It  refers  to  the  lexical  variation.  The  results showed  that...

متن کامل

The Relation of Serum Bilirubin Level with the Severity and Complexity of Coronary Artery Disease and Long-term Outcomes in the Patients Undergoing Primary Percutaneous Coronary Intervention

Background and Aims: Bilirubin has been considered an antioxidant that protects against atherosclerosis. The aim of this study was to evaluate the relationship of serum bilirubin level with the severity and complexity of coronary artery disease (CAD) and long-term outcome in the patients undergoing primary percutaneous coronary intervention (PCI). Materials and Methods: This prospective coho...

متن کامل

پیچیدگی LZ سیستم های دینامیکی آشوبی و سیستم شبه تناوبی فیبوناچی

  The origin the concept of LZ compexity is in information science. Here we use this notion to characterize chaotic dynamical systems. We make contact with the usual characteristics of chaos, such as Lyapunov exponent and K-entropy. It is shown that for a two-dimensional system LZ complexity is as powerful as other characteristics. We also apply LZ complexity to the study of the quasiperiodic F...

متن کامل

A Numerical Study of KPZ Equation Based on Changing its Parameters

In this article we investigate the behaviour of the scaling exponentsof KPZ equation through changing three parameters of the equation. Inother words we would like to know how the growth exponent β and theroughness exponent α will change if the surface tension ν , the averagevelocity λ and the noise strength γchange. Using the discrete form of theequation , first we come to the results α = 0.5 ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1503.01129  شماره 

صفحات  -

تاریخ انتشار 2015